Corpus: ind_news_2008_30K

Other corpora

5.1.18 Words nearly always as next neighbors

Strong NN co-occurrences with a low probability of being separated

The quotient below is calculated as freq(word1)*freq(word1)/NN_freq^2.

Word 1 Word 1 Frequency of word 1 Frequency of word 2 Frequency as NN Qoutient
New York 119 92 89 1.38
Gus Dur 91 78 77 1.20
Rumah Sakit 62 43 43 1.44
Dow Jones 32 35 28 1.43
Wall Street 26 32 26 1.23
Kuala Lumpur 29 31 27 1.23
light sweet 24 24 21 1.31
Bung Karno 23 23 20 1.32
SEA Games 16 20 16 1.25
Dalai Lama 14 20 14 1.43
Astana Giribangun 14 18 14 1.29
Asasi Manusia 13 18 13 1.38
buy back 15 18 15 1.20
Bangka Belitung 18 14 13 1.49
Sjamsul Nursalim 12 14 11 1.39
Los Angeles 13 13 13 1.00
terumbu karang 11 13 10 1.43
Retail Banking 9 12 9 1.33
Ihza Mahendra 10 12 10 1.20
puting beliung 12 12 12 1.00
80 msec needed at 2018-03-09 10:57